The PAPILLON Project: Cooperatively Building a Multilingual Lexical Data-base to Derive Open Source Dictionaries & Lexicons

نویسندگان

  • Christian Boitet
  • Mathieu Mangeot
  • Gilles Sérasset
چکیده

The PAPILLON project aims at creating a cooperative, free, permanent, web-oriented and personalizable environment for the development and the consultation of a multilingual lexical database. The initial motivation is the lack of dictionaries, both for humans and machines, between French and many Asian languages. In particular, although there are large F-J paper usage dictionaries, they are usable only by Japanese literates, as they never contain both original (kanji/kana) and romaji writing. This applies as well to Thai, Vietnamese, Lao, etc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The PAPILLON Project: Cooperatively Building A Multilingual Lexical Data-Base To Derive Open Source Dictionaries And Lexicons

The PAPILLON project aims at creating a cooperative, free, permanent, web-oriented and personalizable environment for the development and the consultation of a multilingual lexical database. The initial motivation is the lack of dictionaries, both for humans and machines, between French and many Asian languages. In particular, although there are large F-J paper usage dictionaries, they are usab...

متن کامل

Papillon Lexical Database Project: Monolingual Dictionaries & Interlingual Links

This paper presents a new research and development project called Papillon. It started as a French-Japanese cooperation between laboratories GETA/CLIPS (Grenoble, France) and NII (Tokyo, Japan). Its goal is to build a multilingual lexical database and to extract from it digital bilingual dictionaries. The database is built with monolingual dictionaries, one for each language of the database, li...

متن کامل

The translation of examples , citations , definitions and glosses in the Papillon project

The Papillon lexical data base comprises a set of detailed monolingual dictionaries of « lexies » (word senses) interlinked through « axies » (interlingual links) which can also refer to external semanticsoriented systems such as UNL « universal words », Worldnet « synsets », Ontos « concepts » or NTT ALT/JE system « semantic classes ». The basic idea is that bilingual or multitarget usage dict...

متن کامل

Data Repository Organization and Recuperation Process for Multilingual Lexical Databases1

This paper describes the data management in multilingual lexical databases. Since NLP systems are using lexical data, the amount of work to build them is huge. That is the reason why it is important to use rigorous powerful systems and to be able to get data for a minimal cost. As an open source project, Papillon is reusing existing lexical data and wants to make volunteers collaborate. To make...

متن کامل

Creating a Multilingual Data Collection for Bilingual Lexicography from Parallel Monolingual Lexicons

In the DELIS project 1 , a set of parallel monolingual lexicon fragments have been designed. They are parallel in two ways: (1) they cover the same fragment (the most general verbs of sensory perception and of speech act), and (2) they are based on the same theoretical approaches and on comparable classiications and descriptive devices. It is claimed in this paper that such parallel fragments, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002